首页> 外文OA文献 >Multiobjective Evolutionary Optimization of Type-2 Fuzzy Rule-Based Systems for Financial Data Classification
【2h】

Multiobjective Evolutionary Optimization of Type-2 Fuzzy Rule-Based Systems for Financial Data Classification

机译:基于2类模糊规则的金融数据分类系统的多目标进化优化

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Classification techniques are becoming essential in the financial world for reducing risks and possible disasters. Managers are interested in not only high accuracy, but in interpretability and transparency as well. It is widely accepted now that the comprehension of how inputs and outputs are related to each other is crucial for taking operative and strategic decisions. Furthermore, inputs are often affected by contextual factors and characterized by a high level of uncertainty. In addition, financial data are usually highly skewed toward themajority class. With the aim of achieving high accuracies, preserving the interpretability, and managing uncertain and unbalanced data, this paper presents a novel method to deal with financial data classification by adopting type-2 fuzzy rule-based classifiers (FRBCs) generated from data by a multi-objective evolutionary algorithm (MOEA). The classifiers employ an approach, denoted as scaled dominance, for defining rule weights in such away to help minority classes to be correctly classified. In particular, we have extended PAES-RCS, an MOEA-based approach to learn concurrently the rule and data bases of FRBCs, for managing both interval type-2 fuzzy sets and unbalanced datasets. To the best of our knowledge, this is the first work that generates type-2 FRBCs by concurrently maximizing accuracy and minimizing the number of rules and the rule length with the objective of producing interpretable models of real-world skewed and incomplete financial datasets. The rule bases are generated by exploiting a rule and condition selection (RCS) approach, which selects a reduced number of rules from a heuristically generated rule base and a reduced number of conditions for each selected rule during the evolutionary process. The weight associated with each rule is scaled by the scaled dominance approach on the fuzzy frequency of the output class, in order to give a higher weight to the minority class. As regards the data base learning, the membership function parameters of the interval type-2 fuzzy sets used in the rules are learned concurrently to the application of RCS. Unbalanced datasets are managed by using, in addition to complexity, selectivity and specificity as objectives of the MOEA rather than only the classification rate. We tested our approach, named IT2-PAES-RCS, on 11 financial datasets and compared our results with the ones obtained by the original PAES-RCS with three objectives and with and without scaled dominance, the FRBCs, fuzzy association rule-based classification model for high-dimensional dataset (FARC-HD) and fuzzy unordered rules induction algorithm (FURIA), the classical C4.5 decision tree algorithm, and its cost-sensitive version. Using nonparametric statistical tests, we will show that IT2-PAES-RCS generates FRBCs with, on average, accuracy statistically comparable with and complexity lower than the ones generated by the two versions of the original PAES-RCS. Further, the FRBCs generated by FARC-HD and FURIA and the decision trees computed by C4.5 and its cost-sensitive version, despite the highest complexity, result to be less accurate than the FRBCs generated by IT2-PAES-RCS. Finally, we will highlight how these FRBCs are easily interpretable by showing and discussing one of them.
机译:分类技术对于减少风险和可能的灾难在金融界正变得至关重要。管理人员不仅对准确性感兴趣,而且对可解释性和透明度也很感兴趣。现在已经被广泛接受,对于输入和输出如何相互关联的理解对于做出运营和战略决策至关重要。此外,投入通常受上下文因素影响,并具有高度不确定性。此外,财务数据通常偏向多数阶级。为了实现较高的准确性,保留可解释性以及管理不确定和不平衡的数据,本文提出了一种新的方法,该方法采用多类型从数据生成的基于2型模糊规则的分类器(FRBC)来处理财务数据分类。目标进化算法(MOEA)。分类器采用一种表示比例优势的方法来定义规则权重,以帮助少数群体正确分类。特别是,我们扩展了PAES-RCS,这是一种基于MOEA的方法,可以同时学习FRBC的规则和数据库,以管理区间2型模糊集和不平衡数据集。据我们所知,这是第一个通过同时提高准确性,最小化规则数量和规则长度,以生成可解释的真实世界中不完整的财务数据集模型的目的而生成2类FRBC的工作。通过利用规则和条件选择(RCS)方法生成规则库,该方法从启发式生成的规则库中选择数量减少的规则,并在进化过程中为每个选定规则选择数量减少的条件。与每个规则相关的权重通过对输出类别的模糊频率的比例控制方法进行缩放,以使少数类别获得更高的权重。关于数据库学习,在规则中使用的间隔类型为2的模糊集的隶属函数参数是在RCS的应用中同时学习的。除复杂性,选择性和特异性外,不平衡数据集还通过MOEA的目标而不仅仅是分类率来管理。我们在11个财务数据集上测试了称为IT2-PAES-RCS的方法,并将我们的结果与原始PAES-RCS获得的结果(具有三个目标且有和没有规模优势),FRBC,基于模糊关联规则的分类模型进行了比较用于高维数据集(FARC-HD)和模糊无序规则归纳算法(FURIA),经典C4.5决策树算法及其成本敏感版本。使用非参数统计检验,我们将显示IT2-PAES-RCS生成的FRBC平均而言在统计学上可与原始PAES-RCS的两个版本生成的FRBC相比,且其复杂度要低。此外,尽管复杂性最高,​​但FARC-HD和FURIA生成的FRBC以及C4.5及其成本敏感版本计算出的决策树比IT2-PAES-RCS生成的FRBC精度低。最后,我们将通过展示和讨论其中之一来突出说明这些FRBC的理解方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号